4 research outputs found
Overview of BioASQ 2021-MESINESP track. Evaluation of advance hierarchical classification techniques for scientific literature, patents and clinical trials
CLEF 2021 – Conference and Labs of the Evaluation Forum, September 21–24, 2021, Bucharest, Romania,There is a pressing need to exploit recent advances in natural language processing technologies, in
particular language models and deep learning approaches, to enable improved retrieval, classification
and ultimately access to information contained in multiple, heterogeneous types of documents. This is
particularly true for the field of biomedicine and clinical research, where medical experts and scientists
need to carry out complex search queries against a variety of document collections, including literature,
patents, clinical trials or other kind of content like EHRs. Indexing documents with structured controlled
vocabularies used for semantic search engines and query expansion purposes is a critical task for enabling
sophisticated user queries and even cross-language retrieval. Due to the complexity of the medical domain
and the use of very large hierarchical indexing terminologies, implementing efficient automatic systems
to aid manual indexing is extremely difficult. This paper provides a summary of the MESINESP task
results on medical semantic indexing in Spanish (BioASQ/ CLEF 2021 Challenge). MESINESP was carried
out in direct collaboration with literature content databases and medical indexing experts using the DeCS
vocabulary, a similar resource as MeSH terms. Seven participating teams used advanced technologies
including extreme multilabel classification and deep language models to solve this challenge which can
be viewed as a multi-label classification problem. MESINESP resources, we have released a Gold Standard
collection of 243,000 documents with a total of 2179 manual annotations divided in train, development
and test subsets covering literature, patents as well as clinical trial summaries, under a cross-genre
training and data labeling scenario. Manual indexing of the evaluation subsets was carried out by three
independent experts using a specially developed indexing interface called ASIT. Additionally, we have
published a collection of large-scale automatic semantic annotations based on NER systems of these
documents with mentions of drugs/medications (170,000), symptoms (137,000), diseases (840,000) and
clinical procedures (415,000). In addition to a summary of the used technologies by the teams, this paperS
SciELO: uma metodologia para publicação eletrônica
Descreve a Metodologia SciELO - Scientific Electronic Library Online para a publicação eletrônica de periódicos científicos, abordando temas como a transição da publicação impressa em papel para a publicação eletrônica, o processo de comunicação científica, os princípios que nortearam o desenvolvimento da metodologia, sua aplicação no site SciELO, seus módulos e componentes, os instrumentos nos quais está baseada etc. O artigo discute, também, as potencialidades e tendências para a área no Brasil e América Latina, apontando questões e propostas que deverão ser abordadas e solucionadas pela metodologia. Conclui que a Metodologia SciELO é uma solução eficiente, flexível e ampla para a publicação científica eletrônica
SciELO : una metodología para la publicación electrónica
It describes the SciELO Methodology Scientific Electronic Library Online for electronic publishing of scientific periodicals, examining issues such as the transition from traditional printed publication to electronic publishing, the scientific communication process, the principles which founded the methodology development, its application in the building of the SciELO site, its modules and components, the tools use for its construction etc. The article also discusses the potentialities and trends for the area in Brazil and Latin America, pointing out questions and proposals which should be investigated and solved by the methodology. It concludes that the SciELO Methodology is an efficient, flexible and wide solution for the scientific electronic publishing
SciELO: una metodología para la publicación electrónica
Se describe la Metodología SciELO (Scientific Electronic Library Online) para la difusión por vía electrónica de publicaciones periódicas científicas. Se abordan temas como la transición de la publicación impresa en papel hacia la publicación electrónica, el proceso de la comunicación científica, los principios que orientaron el desarrollo de la metodología, su aplicación en el sitio SciELO, sus módulos y componentes, los instrumentos en los cuales se basa, etcétera. Se discuten, asimismo, las potencialidades y tendencias disponibles en esta esfera en Brasil y en América Latina, y se indican cuestiones y propuestas que la metodología deberá abordar y solucionar. Se concluye que la Metodología SciELO es una solución eficiente, flexible y amplia para la publicación científica electrónica